On Answering Why-Not Queries Against Scientific Workflow Provenance
نویسنده
چکیده
Why-not queries help scientists understand why a given data item was not returned by the executions of a given work�ow. While answering such queries has been investigated for relational databases, there is only one proposal in this area for work�ow provenance, viz. the Why-Not algorithm. This algorithm makes the assumption that the modules implementing the steps of the work�ow preserve the attributes of the input datasets. This is, however, not the case for all work�ow modules. We drop this assumption, and show in this paper how theWeb can be harvested to answer why-not queries against work�ow provenance.
منابع مشابه
Exploring Scientific Workflow Provenance Using Hybrid Queries over Nested Data and Lineage Graphs
Existing approaches for representing the provenance of scientific workflow runs largely ignore computation models that work over structured data, including XML. Unlike models based on transformation semantics, these computation models often employ update semantics, in which only a portion of an incoming XML stream is modified by each workflow step. Applying conventional provenance approaches to...
متن کاملEfficiently Computing Provenance Graphs for Queries with Negation
Explaining why an answer is in the result of a query or why it is missing from the result is important for many applications including auditing, debugging data and queries, and answering hypothetical questions about data. Both types of questions, i.e., why and why-not provenance, have been studied extensively. In this work, we present the first practical approach for answering such questions fo...
متن کاملUnderstanding Collaborative Studies through Interoperable Workflow Provenance
The provenance of a data product contains information about how the product was derived, and is crucial for enabling scientists to easily understand, reproduce, and verify scientific results. Currently, most provenance models are designed to capture the provenance related to a single run, and mostly executed by a single user. However, a scientific discovery is often the result of methodical exe...
متن کاملSGProv: Summarization Mechanism for Multiple Provenance Graphs
Scientific workflow management systems (SWfMS) are powerful tools in the automation of scientific experiments. Several workflow executions are necessary to accomplish one scientific experiment. Data provenance, typically collected by SWfMS during workflow execution, is important to understand, reproduce and analyze scientific experiments. Provenance is about data derivation, thus it is typicall...
متن کاملSHARP: Harmonizing Galaxy and Taverna Workflow Provenance
SHARP is a Linked Data approach for harmonizing cross-workflow provenance. In this demo, we demonstrate SHARP through a real-world omic experiment involving workflow traces generated by Taverna and Galaxy systems. SHARP starts by interlinking provenance traces generated by Galaxy and Taverna workflows and then harmonize the interlinked graphs thanks to OWL and PROV inference rules. The resultin...
متن کامل